Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases

نویسندگان

چکیده

In this article, a novel combined i-vector and an Extreme Learning Machine (ELM) is proposed for speaker identification. The ELM chosen because it fast to train has universal approximator property. Four combinations of features based on Mel Frequency Cepstral Coefficient Power Normalized are used. Besides, seven fusion methods exploited. system evaluated with three different databases, namely: the SITW 2006, NIST 2008, TIMIT database. This work employs 2016 database first time identification using integration between approach. From each database, 120 speakers 1200 speech utterances used (overall 360 3600 utterances). Furthermore, comprehensive evaluations exploited wide range realistic background noise types (Stationary AWGN Non-Stationary Noise types) handset effect. compared Gaussian Mixture Model-Universal Background Model (GMM-UBM) other states art approaches. results show that method outperforms GMM-UBM approach state- of-the-art under specific conditions, techniques can be improve robustness effects.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The NIST 2014 Speaker Recognition i-Vector Machine Learning Challenge

During late-2013 through mid-2014 NIST coordinated a special machine learning challenge based on the i-vector paradigm widely used by state-of-the-art speaker recognition systems. The i-vector challenge was run entirely online and used as source data fixed-length feature vectors projected into a low-dimensional space (i-vectors) rather than audio recordings. These changes made the challenge mor...

متن کامل

The 2016 NIST Speaker Recognition Evaluation

In 2016, the National Institute of Standards and Technology (NIST) conducted the most recent in an ongoing series of speaker recognition evaluations (SRE) to foster research in robust text-independent speaker recognition, as well as measure performance of current state-of-the-art systems. Compared to previous NIST SREs, SRE16 introduced several new aspects including: an entirely online evaluati...

متن کامل

CRSS systems for the NIST i-Vector Machine Learning Challenge

This paper describes the systems developed by the Center for Robust Speech Systems (CRSS), Univ. of Texas Dallas, for the National Institute of Standards and Technology (NIST) iVector challenge. Since the emphasis of this challenge is on utilizing unlabeled development data, our system development focuses on: 1) unsupervised clustering methods to estimate development data labels; 2) build effic...

متن کامل

UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation

This study describes systems submitted by the Center for Robust Speech Systems (CRSS) from the University of Texas at Dallas (UTD) to the 2016 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE). We developed 4 UBM and DNN i-vector based speaker recognition systems with alternate data sets and feature representations. Given that the emphasis of the NIST SR...

متن کامل

designing and validating a textbook evaluation questionnaire for reading comprehension ii and exploring its relationship with achievement

در هر برنامه آموزشی، مهم ترین فاکتور موثر بر موفقیت دانش آموزان کتاب درسی است (مک دونو و شاو 2003). در حقیقت ، کتاب قلب آموزش زبان انگلیسی است( شلدن 1988). به دلیل اهمیت والای کتاب به عنوان عنصر ضروری کلاس های آموزش زبان ، کتب باید به دقت ارزیابی و انتخاب شده تا از هرگونه تاثیر منفی بر دانش آموزان جلوگیری شود( لیتز). این تحقیق با طراحی پرسش نامه ارزیابی کتاب که فرصت ارزیابی معتبر را به اساتید د...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Circuits Systems and Signal Processing

سال: 2021

ISSN: ['0278-081X', '1531-5878']

DOI: https://doi.org/10.1007/s00034-021-01697-7